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Abstract 

By a profound result of Heinrich, Novak, Wasilkowski, and Wozniakowski the inverse of the star- 
discrepancy n*(s,e) satisfies the upper bound n*(s,e) < c a b s s£ -2 . This is equivalent to the fact 
that for any TV and s there exists a set of TV points in [0, l] s whose star-discrepancy is bounded by 
Cab s s 1//2 iV -1 / 2 . The proof is based on the observation that a random point set satisfies the desired 
discrepancy bound with positive probability. In the present paper we prove an applied version of this 
result, making it applicable for computational purposes: for any given number q 6 (0, 1) there exists 
an (explicitly stated) number c(q) such that the star-discrepancy of a random set of TV points in [0, l] s 
is bounded by c(g)s 1,/2 TV _1 / 2 with probability at least q, uniformly in TV and s. 

1 Introduction and statement of results 

The number n*(s, e), which is defined as the smallest possible cardinality of a point set in [0, l] s having 
discrepancy bounded by e, is called the inverse of the discrepancy. Heinrich, Novak, Wasilkowski, and 
Wozniakowski [10] proved the upper bound 

n*(s,e) < c abs se -2 , (1) 

which is complemented by the lower bound 

n*(s,e) > c abs s£ _1 

due to Hinrichs [1111 (throughout the paper, c a b s denotes absolute constants, not always the same). Hence 
the inverse of the star-discrepancy depends linearly on the dimension, while the precise dependence on e 
is still unknown. It is easy to see that (Q3 is equivalent to the fact that for any TV and s there exists a set 
Vn of TV points in [0, l] s such that the star-discrepancy D* N of this point set is bounded by 

D* N (V N ) < c abs ^| (2) 
V TV 



(recently we showed that it is possible to choose c a b s = 10 in (O, see (T)). The existence of such a 
point set directly follows from the surprising observation that a randomly generated point set (that is, a 
Monte Carlo point set) satisfies the desired discrepancy estimate with positive probability. Of course, for 
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applications such a mere existence result is not of much use, as was remarked by several colleagues at the 
MCQMC 2012 conference in Sydney. For this reason, in the present paper we prove an applied version 
of (0, which provides estimates for the probability of a random point set satisfying (0 (depending on 
the value of the constant). As our Theorem Q] below shows, this probability is extremely large already 
for moderate values of c, for example for c = 20. Additionally, the quality of our estimates for these 
probabilities improves as the dimension s increases (which is somewhat counter-intuitive, and originates 
from the exponential inequalities used in the proof, which cause a "concentration of mass" phenomenon). 

The fact that the probability of a random point set satisfying (0 is very large is in contrast to the fact 
that no general constructions of point sets satisfying such discrepancy bounds are known. So far, the best 
results are a component-by-component construction of Doerr, Gnewuch, Kritzer and Pillichshammer (2), 
a semi-deterministic algorithm based on dependent randomized rounding due to Doerr, Gnewuch, and 
Wahlstrom [3], and a construction of Hinrichs lfT2l of a "structured" set of N — 256 points in dimension 
s = 15 having discrepancy less than 1/4 (by this means solving one instance of an open problem in O). 

For more information concerning the inverse of the discrepancy and tractability of multidimensional 
integration we refer to a recent survey article of Gnewuch [7|, and to the monograph of Novak and Woz- 
niakowski 11311141 . A collection of open problems on this topic can be found in |9|. 



In the present paper, we will prove the following theorem. 



Theorem 1 For any s > 1, N > 1 and q G (0, 1) a randomly generated s-dimensional point set 
(z\, ■ ■ ■ , zn) satisfies 



D s r, : Nl :,r„7 v /4.9 + l0g((1 ; g) " 1) ^ (3) 



with probability at least q. 

It is interesting that the quality of the discrepancy estimate in TheoremQ] improves as the dimension s in- 
creases; for example the necessary number c(q, s) to have star-discrepancy bounded by c(q, s)s 1 / 2 N~ x / 2 
with probability at least 90% is 15.30 in dimension s = 1, while it is only 12.65 in dimension s = 100. 
However, neglecting this advantage of large dimensions in order to obtain a result which holds uniformly 
in s, one immediately obtains the following corollary. 



Corollary 1 For any s > 1, N > 1 and q 6 (0, 1) a randomly generated s-dimensional point set 
(zi, . . . , zn) satisfies 

D* N (z u ...,z N )< 5.7^4.9 + log ((1 - q)- 1 )^ (4) 

v jv 

with probability at least q. 

TheoremQ]shows that the probability that a random point set satisfies the discrepancy bound c(q, s)s 1 / 2 N~ x / 2 
is extremely large already for moderate values of c(q,s). The following table illustrates this fact, for 
s = 10 and s = 100. 



q 


0.01 


0.5 


0.9 


0.99 


0.999 


c(q,10) 


12.62 


12.71 


12.92 


13.20 


13.48 


c(q,100) 


12.62 


12.63 


12.65 


12.68 


12.71 
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As the table shows, the probability that a random point set has "small" discrepancy in the sense that its 
discrepancy is bounded by cs 1 / 2 N^ 1 / 2 for some moderate c (for example, c = 20) is extremely large. 
This observation is an exciting counterpart of the fact that we do not have the slightest idea of how 
to construct point sets satisfying such discrepancy bounds, even for moderate N and s. It should also be 
noted that calculating the star-discrepancy of a given (high-dimensional) point set is computationally very 
difficult, see |@1[8]]. Hence, although our results show that the probability of a random point set having 
small discrepancy is very large, checking that a concrete point set satisfies such discrepancy bounds is in 
general (in high dimensions) a computationally intractable problem. 

2 Preliminaries 

Throughout the paper, s > 1 denotes the dimension and A denotes the s-dimensional Lebesgue measure. 
For x,y £ [0, l] s , where x = (xi,..., x s ) and y = (yi, . . . , y s ), we write x < y if Xi < yi, 1 < i < s, 
and for any x £ [0, l] s we write [0, x] for the set {y £ [0, l] s : < y < x}. Furthermore, we write |^4| 
for the number of elements of a set A. 

The following Lemma Q] of Gnewuch J5J Theorem 1.15] is a central ingredient in the proof of our main 
result. For convenience we use the notation from [|5] and 0: For any <5 £ (0, 1] a set T of points in [0, l] s 
is called a <5-cover of [0, 1} S if for every y £ [0, l] s there exist x, z £ T U {0} such that x < y < z and 
A([0, z)) — A([0, x)) < 5. The number Af(s, S) denotes the smallest possible cardinality of a <5-cover of 

[o,i] s . 

Similarly, for any 5 £ (0, 1] a set A of pairs of points from [0, l] s is called a (5-bracketing cover of [0, l] s , 
if for every pair (x, z) £ A the estimate A([0, z)) — A([0, x)) < 5 holds, and if for every y £ [0, l] s there 
exists a pair (x, z) from A such that x < y < z. The number Af[ ] (s, 6) denotes the smallest possible 
cardinality of a (5-bracketing cover of [0, l] s . 

Lemma 1 For any s > 1 and 6 £ (0, 1] 

M(s,5) < (2e) s (S- 1 + l) s 

and 

Af u {s,6) <2 s - 1 e s ( ( 5- 1 + l) s . 

By Lemma[T]for any 1 < k < K there exists a 2~ fc -cover of [0, l] s , denoted by Tfc, such that 

|r fc | < (2e) s (2 fc + l) s . 

Furthermore we denote by Aa' a 2~ K - bracketing cover for which 

|Ajf| < 2 s ' 1 e s (2 K + l) s , 

which also exists due to LemmaQ] Moreover we define Tk as 

— {v £ [0, l] s : (v, w) £ A K for some w}. 

By definition for every a; £ [0, l] s there exists apair (vk, wk) — {vK{x) 1 WK{x))fo^^i\\\c\\(vK 1 'WK) £ 
Ak such that vk < x < wk and 

\([o, WK ])-\([ 0)VK ])<-L. 
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Furthermore for every k, 2 < k < Kandj £ 1^ there exist Vk-x — Ufc-i (7), Wk-i = Wk-x( , y), i>fe-i, ttffc-i £ 
U {0}, such that < 7 < tOfe-i and 

A([0,u; fc _i])-A([0, «*_!])< ' 



2 fe-f 



We define 

Pk{x) = v K (x) 

Pk-i(x) = v K ^i(p K (x)) = v K -i(v K (x)) 

Pk-2(x) = v K - 2 (pk-i(x)) = v K -2(vk-i(vk{x))) 

pi(x) = v 1 (p 2 (x)), 

andpx + i(a;) = wk{x), Po{x) = 0. For x, y € [0, l] s we set 

[0,y]\[0,x] if x ^0, 

[x,y] := <( [0,y] ifx = 0, y ^ 0, 

3 ifx = y = 0. 



Then the sets [pfc(x),pfc+i(x)], 1 < k < K, are disjoint, and we obtain 

A'-l if 

|J [pfe(x),p fc+ i(x)] c [0,x] c |J [pfc(x),p fc+ i(x)], Vx€[0, l] s 

fc=0 fe=0 

Hence for every x, y £ [0, l] s 

if-i if 



fc=0 fc=0 

Moreover, independent of x, we have for < k < K 

A (\p k {x),Pk+i{x)]} < 



For < fc < K we define to be the set of all sets of the form [pfc(x) , pk+i(x)], where x £ [0, l] s . 
Then for < k < K, as a consequence of LemmaQ] we can bound the cardinality of Ak by 

\A k \ < (2er (2 fc+1 + l) s . (6) 

Note that all elements of Ak, where < k < K, have Lebesgue measure bounded by 2~ k . This dyadic 
decomposition method was introduced in 0"), where it is described in more detail. 

Let Xi, . . . , Xn be independent, identically distributed (i.i.d.) random variables defined on some prob- 
ability space (f2, A, P) having uniform distribution on [0, 1} S , and let I E Ak for some k > 0. Then 
the random variables lj(Xi), . . . , 1j(Xn) are i.i.d. random variables, having expected value A(I) and 
variance 

Am Am 2 </ 2_fe (l-2- fc ) forfc>l, 

A(/)-A(J) < I 1/4 forfc = 0. (?) 
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Since the X n are independent it follows that the random variable Yln=i l/C^n) nas expected value 
NX(I) and variance N(X(I) - A(7) 2 ). 

In the proof of our main result we need two well-known results from probability theory, namely Bern- 
stein's and Hoeffding's inequality. Bernstein's inequality states that for Z\, . . . , Zn being i.i.d. random 
variables, satisfying E Z n = and \ Z n \ < C a.s. for some C > 0, 



A' 



5> 



> t < 2 exp 



By applying this inequality to the random variables lj(X n ) — A(J), we obtain 

i 2 



A" 



£ lz(X„) - JVA(J) 



>< < 2 exp - 



2 (JVA(J) (1 - A(7))) + 2t/3 



for i > 0. Using (0 we conclude 



N 



n=l 



> t < 2 exp 



27V2- fe (l - 2- fc ) + 2i/3 



For fc 6 {0, 1} we use Hoeffding's inequality, which yields 



N 



2t 2 

I | : 2exp(- — 



for k > 2. (8) 



(9) 



p 53 ij(jr„) - jva(i) 

\ n=l 

3 Proof of Theorem [I] 

Since the theorem is trivial for N < 32 (s + log ((1 — g) -1 )) < 5.7 2 (s + log ((1 — q)^ 1 )) we assume 
that iV > 32 (s + log ((1 - g) -1 )) and set 



K 



log 2 iV-log 2 ( S + log((l- (Z )- 1 )) 



Then K > 3, and 



2-^e 



^/s + logai-q)-!) ^s + logai-g)-!) 



Furthermore we have 



2VN 



'sN = < 



N 



ffi v/ S + log((l-g)-i)' 
By choosing t = cvsN for some c > 0, we conclude from (0, (01 and (fTTT) that for any c > 



(10) 



(11) 



< 



AT 

J2Wn)-NX(I) 

n=l 

2e- 2c2s 
2 exp 



2-2- fc (l-2- fc ) + 



3y'» + log((l-g)-l) 



for k = 0, 1 
for 2 < k < K. 



(12) 
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Let B}., k = 0, . . . , K be given as 



B k = |J 



ieA k 



N 



£ l,(X n ) - NX(I) 



> c k VsN j 



(13) 



The strategy of the proof is to find for any given q G (0, 1) constants = c/s (g), k = 0, . . . , K for which 



A' 



fc=0 



holds for any given g. 

First we consider the case k = 0. By © we have that | Aq| < (6e) s . We choose 



Co 



1 + log(6) , log (8(1 -g)- 1 ) 1 

~~ 2— + S ^7f A/ 



log((l- g )-i) 



thus together with ( TT~2T > and (TPTb it follows that 

P(B ) < \A \2e~ 2c o s < (6e) s 2e-^ 1+log ( 6 »^-^ = 

8 4 

Furthermore we get by (0 that |Al| < (10e) s and with 



J l + logqo) + log (8(1 -g)- 1 ) < J_ / 5 39 + log^l-g)- 1 ) 
2 2s y2 V s 



we obtain that 

P(Bi) < |^4i|2e- 2c ? s < (10e) s 2e- s(1+log(10)) 
Next we consider the case 2 < k < K.By ( fT2l and (Tl3T > we have 

/ 

P(B fc ) < |A fc | -2-exp 



(l-q) 1-q 



els 



V 



2 • 2- fc (l - 2- fc ) + 



S^/s+logCCl-g)-!) / 



We set 



(14) 



(15) 



(16) 



« . ^ + lo g(2 (2« + 1)) + ^l&M f^V*|g 
hence we get that 



< 



^ + log ((1 -<?)-!) 

^/l + log(2(2 fc + 1 + 1)) + log(2( fc + 1 )) + ^SSL ^-i)- 1 ) ^2 ■ 2- fc (l - 2~ fc ) + 2m ^ 
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< 



^Jl + log(2(2 fe + 1 + 1)) + log(2( fe + 1 ))y /, 2 • 2- fe (l - 2- fe ) + 



2.08 • 4 • 2- K 



<2.08 

for 2 < k < K. Thus by ([161) we obtain 



'(Bfc) < |A fc | -2-exp - 



2 • 2- fc (l - 2- fc ) + 



3^s+log((l-g)-i) 

< (2e) s (2 (fc+1) + l) s • 2 ■ exp (-s (l + log(2(2 (fc+1) + 1)))) 
1-9 



1-g 

2 (fc+i) 



2 fc ■ 

Summing up the estimated probabilities gives 



fc=0 V fe=3 / 



Therefore with at least probability q, a realization Xx (w), . . . , X n (w) is such that 



K 



i U s *- 



fc=0 



We denote by z n a point set which is defined by such a realization, i.e. 



A" 



z n = X n (uj), 1 < n < N, for some u) £ B^. 



k=0 



Set A fc = y/2 ■ 2- fe (l - 2- fc ) + 2.08 • 4 • 2~ K /3. Then 

a a 



fe=2 k=2 
A 



Ec fe = EA fe A/l + log(2(2'=+i + l)) 



log(2( fe + 1 )(l-g)- 1 ) 



fc=2 
A 



< ^AfcWl + log2 + log(2 fe + 1 ) + 0.12• 



log(2(*+l)) , logai-g)- 1 ) 



fe=2 



< 5^A fc Jl.l2 + (2* + 3)log2- 



log((l-g)-i) 



< \ 1.12 + 7 log 2 



log ((1-g)- 1 ) 1 



K 



2_j v^Afc 



fc=2 



< 3.28W5.98 



Iog((l-g)-i) 



Therefore we obtain by using (fT4l . (TT~5T > and ( fTvT > 



A 



fc=0 



V2 



£ c fc <^^/4.88 + ^((^g)- 1 ) + 4=^/5.39 + l0g - ^ 



V2 



(17) 



3.28^5.98+ l0g((1 ; g) ' 1} (18) 
Applying ©, ( [Tot . ( fT8l and Jensen's inequality we obtain 

N K N 

X Mo,x] (Zn) < X X 1 [p fc ( a; ),p fc + i( a; )]( Z ™) 
n=l fc=Ora=l 

< ATA([0,iu K (z)])+ViiVX c * 

K 

< NX([0,x}) + NX([x,w K (x)]) + VTN^Ck 

k=0 

< NX([0, x}) + n ^ + ^-^ I + ^NJ2 



Cfc 

fc=0 



< ATA([0. ) + 5.7\/4.«) + l0g((1 - g) ^ Vf.V. 
Similarly a lower bound is given by 

N K-1 

X l[o,*]On) > ATAflp.pj^a;)]) - X ^ 

n=l k=0 



> NX([0, x}) - 5.7W4.9 + — ^-^VTN. 



Combining the above bounds we finally arrive at 



,y , , r - • , , l0 g((! -<?) _1 ) 

DjV^i,...,^) < 5.7\/4.9 
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